Test eval workflow (Do not merge) #124

pamelafox · 2024-10-23T23:55:22Z

No description provided.

pamelafox · 2024-10-23T23:55:29Z

/evaluate

github-actions · 2024-10-23T23:55:42Z

Starting evaluation! Check the Actions tab for progress, or wait for a comment with the results.

github-actions · 2024-10-23T23:58:35Z

Evaluation results

metric	stat	baseline	pr124
gpt_groundedness	pass_rate	1.0	1.0
↑	mean_rating	5.0	5.0
gpt_relevance	pass_rate	1.0	1.0
↑	mean_rating	5.0	5.0
answer_length	mean	1017.6	995.8
latency	mean	2.56	1.94
citations_matched	mean	0.73	0.73

Check the workflow run for more details.

Test eval workflow

87f15f6

pamelafox closed this Oct 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Test eval workflow (Do not merge) #124

Test eval workflow (Do not merge) #124

Uh oh!

pamelafox commented Oct 23, 2024

Uh oh!

pamelafox commented Oct 23, 2024

Uh oh!

github-actions bot commented Oct 23, 2024

Uh oh!

github-actions bot commented Oct 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Test eval workflow (Do not merge) #124

Test eval workflow (Do not merge) #124

Uh oh!

Conversation

pamelafox commented Oct 23, 2024

Uh oh!

pamelafox commented Oct 23, 2024

Uh oh!

github-actions bot commented Oct 23, 2024

Uh oh!

github-actions bot commented Oct 23, 2024

Evaluation results

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant